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Abstract 

The geometry of unit iV-dimensional £ p balls (denoted here by Bjv,p) has been 
intensively investigated in the past decades. A particular topic of interest has been the 
study of the asymptotics of their projections. Apart from their intrinsic interest, such 
questions have applications in several probabilistic and geometric contexts [BGMN05J. 
In this paper, our aim is to revisit some known results of this flavour with a new 
point of view. Roughly speaking, we will endow Btv, p with some kind of Dirichlet 
distribution that generalizes the uniform one and will follow the method developed in 
[Ski67] , [CKS93] in the context of the randomized moment space. The main idea is to 
build a suitable coordinate change involving independent random variables. Moreover, 
we will shed light on connections between the randomized balls and the randomized 
moment space. 

Keywords: Moment spaces, ^-balls, canonical moments, Dirichlet dis- 
tributions, uniform sampling. 
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1 Introduction 

The starting point of our work is the study of the asymptotic behaviour of the moment 
spaces: 

Mp 1] = \(f t^(dt)) : p e A^QO, 1]) 1 , (1) 

\\Jo / l<j<N J 
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where A^i([0, 1]) denotes the set of all probability measures on [0, 1]. These compact sets 
are randomized with the uniform distribution. By using cleverly an old Skibinsky's result 
QSki67]), the authors of the seminal paper [CKS93] show two very nice results. First they 
proved that, for large N, the sets (m]^' 1 ^) are almost concentrated in terms of finite dimen- 
sional projections on one point (the moments of the arcsine law). Secondly, they obtained a 
multidimensional CLT for the fluctuations of the finite dimensional projections. The asymp- 
totic covariance matrices only involve the moments of the arsine law. These results have 
been extended to large deviations asymptotics in |GLC04j . The main tool for the study of 
randomized moment spaces is the existence of a nice coordinate change leading to indepen- 
dent random variables. This coordinate change is obtained by the so-called Knothe map (see 
the proof of the first theorem p. 40 in |Kno57j ) that is available for any finite dimensional 
bounded convex body. The new coordinates are called canonical moments in the literature. 
We refer to the excellent book [DS97] for a complete overview on canonical moments. Notice 
that recently these results have been extended to matricial moment spaces (see [DS02J). 

In this paper we will focuss both on randomized iV-dimensional £ p balls (denoted by M NjP ) 
and randomized moment spaces. First the randomized ball will be studied by using the 
Knothe map. Surprisingly, as in the case of moment spaces, when the ball is endowed with 
the uniform distribution, the Knothe map leads to canonical coordinates that are also in- 
dependent random variables. Furthermore, we will show that this property remains true 
for a general family of distributions on the ball (called p-generalized Dirichlet distribution, 
see Section I2.6.2p . Independence will be the main tool to investigate the properties of the 
randomized balls. Indeed, it will enable to easily show general association results for the 
p-generalized Dirichlet distribution (see Section f2 . 6 . 3 p . Furthermore, with the help of the 
canonical coordinates we will study various Poincare-Borel like lemmas for these distribu- 
tions. That is, convergence and fluctuations of the projections when the dimension of the 
space increases (see Section [3]). 

There is a revival interest on the moment problem and on orthogonal polynomials on the 
torus T. It is also possible in this frame to define canonical moments. They are also some- 
times called Verblunsky coefficients (see for example [Sim0 5bj). Notice that these coefficients 
have a lot of properties. For example they are involved in the inductive equations of orthogo- 
nal polynomials construction. In this paper, we will discuss connections between randomized 
moment spaces (for moment problems on T), and the ball Bjv iP . This connection will be ob- 
tained through the canonical coordinates (see Section I4~2l) . 

The paper is organized as follows. In the next section, we recall some definitions and useful 
properties of Dirichlet distributions. We also recall the definition and some basic properties 
of the canonical moments on a compact interval. Then, building the same parametriza- 
tion for Mn, p we introduce and study generalized Dirichlet distributions on Btv, p (called 
the p-generalized Dirichlet distributions). The uniform distribution appears to be a special 
case. We also discuss the connections between these results and the so-called stick-breaking 
construction of the Dirichlet distributions. We end the section settling negative association 
properties for these distributions. In Section [3j we obtain asymptotic results for p-generalized 
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Dirichlet distributions. Let us notice that there is an extension of generalized Dirichlet dis- 
tributions in the context of matrix balls (see |Ner02j ). 

We give several applications of the representation on independent variables proving sev- 
eral versions of the Poincare-Borel lemma (see e.g. [Led96] ) working both with weak conver- 
gence and large deviations. Finally in the last Section, we discuss some connections between 
randomized balls and randomized moments spaces. 



2 Probabilities for moment sets and balls 

2.1 The Dirichlet world 

Let us recall some useful properties and definitions related to Dirichlet distributions (see 
for example |KotzOO] ) . A large class of laws on the unit ball may be built from the Dirichlet 
distributions. 

We use two definitions of simplices. For k > 1, we set 

<S k +i = {(aci, ■ • • ,x k+1 ) : Xi > 0, (i = 1, • • ■ , k + 1), x 1 H ha; fe+ i = l}, 

$k = {(xi,- ■ ■ ,Xk) ■ Xi> 0,(i = 1,- ■ ■ ,k), xi-\ \-x k <l}. 



It is clear that the mapping (xi, ■ ■ ■ , Xk+i) ^ (xi, ■ ■ ■ ,Xk) is bijection from S^+i onto 
For a\, ■ ■ • , <2fc + i > 0, the Dirichlet distribution Dir(ai, ■ ■ • , a k +i) on S k +i has the density 



r(ai j h Qfc+i ) ^.oi-i _ „«fc+i-i 

r(ai) • --rfak+i) 



with respect to the Lebesgue measure on Sp.+i- It can also be viewed as a distribution 
Dir fc (ai, • • • , a k ; a k+ i) on with density 

J {xi, ,x k) - r(ai) ... r(afe+i) H U *1 X k) 

The particular case ai = • • • = a£ + i = 1 is the uniform distribution on <S^. 

We recall that the family of Dirichlet distributions is stable by partial sum grouping, i.e. 
if (<ti, ■ • • , a m ) is a partition of {1, • • ■ ,k+ 1}, then 

(xi, • • • , x k+ i) = Dir(ai, • • • , o fc+ i) =4> (Xi, • • • , X m ) = Dir(Ai, • • • , A m ) (2) 
where X { = J2 j£Ut Xj and Ai = Yjj^ a r Moreover 

Xi X m ^i \ d 



Xx + • " + X m _i ' ' Xi + ■ ■ • + X m _i 



Dir(A ir -- ,A W _0 (3) 



and this vector is independent of X\ + ■ ■ • + X m -\. 

If ai = ■ ■ ■ = a k +i = a, we denote the distribution by Dirfc(a). If k = 2 the distribution 
Diri(a; b) on (0, 1) is the Beta(a, b) distribution. Sometimes we need the affine push forward 
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on (—1, +1) of this distribution. We denote this last distribution by Beta s (a, b). To end this 
preliminary let us recall the classical relation between Gamma and Dirichlet distributions. 
For a, A > 0, we say that Z ~ 7(0, A) whenever its distribution has the following density 

y a ~ l \ a 

K,x{y) ■= r , v exp(-Ay), (y > 0). 

r(a) 

We use frequently the slight abuse of notation 7(a) = 7(0, 1). 
If yi, % = 1, 2, • • • , r are independent and if yi — j(bi) then 

" J ^ ' Dir (&!,••• A) (4) 



H hy r 2/i H VVr, 

and this variable is independent of y\ H — • + y r . It is a generalization of the well known fact 

d l(a) 



Beta(a, b) 

where 7(a) and 7' (b) are independent. 

Let G p be the distribution on R with density 



7(a) + 7 '(6) 



.t 1 — y 7 r- e 



2r (1 + ; 



1 Ixl* 



It is the distribution of eZ 1 ^ where Z has the ^{p x ) distribution, and e is a Rademacher 
variable (P(e = ±1) = 0.5) independent of Z. 

2.2 Stick-breaking and generalized Dirichlet distributions 

The following classical model has been widely used in geometric probability genetics, 
Bayesian statistics, etc.... It leads to the fascinating area of random distributions (see King- 
man |Kin75] and Pitman [Pit06]). It is often known as stick-breaking. For the sake of 
consistency, let us explain the details of the construction. 

We define two sets of variables (Z 1; ■ • • , Z n ) G (0, l) n and (Pi, ■ ■ • , P n ) G S< connected 
by the system of equations 

Z 1 = P 1 

1 (5) 
Z j = P j (l-P 1 Pj-i) , 3 = 2,. --n. 



which is equivalent to 

Pi = Z 1 

"i-i 



P 3 — Z 3 



,k=l 



9 (6) 

3 = 2, ■■■n. 



We may add Z n+ i = 1 which is equivalent to P n+ i = \ — P\ — ■ ■ ■ — P n 
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It is possible to define the infinite model: {Zj)j>\ and {Pj)j>\ connected by ([5]) and 
In that case, Ylj Pj = 1 is equivalent to Ylji^- ~ %j) = 0- Actually, it can be thought as a 
sequential procedure to generate an element of S n (or Soo) viewed as a partition of [0, 1] into 
segments. The value Pi = Z 1 gives a bisection [0, Pi] U (Pi, 1] of [0, 1]. To the rightmost 
segment we perform a new bisection in proportion Z 2 , so that (Pi, 1] gives (Pi, P 2 ] U (P 2 , 1] 
with P 2 = Z 2 (l — Pi), and so on. 

They are essentially two ways to provide these variables with probability distribution, 
starting either from the Z's or from the P's. The common feature of all popular random- 
izations is the independence of the Z variables. In the elementary model Zj is uniform on 

[0, 1] for every j < n. The model was extended successively to Zj = Beta(l, 9) with 9 > 

(it is the so called GEM(6 I ) model), and later to Zj — Beta(l — a, 9 + ja) for 9 > —a and 
< a < 1 (it is the so called GEM(a,#) model). The bibliography in |Fen06j is rather ex- 
tensive. Besides, for biological and Bayesian statistical motivations, Connor and Mosimann 
|CM69j . assumed 

Zj = Beta(aj, bj) , j = 1, • • • , n 

with Z Tl+ \ = 1, and where a\, ■ ■ ■ ,a n ,b\,--- ,b n are positive numbers. They noticed (formula 
14 p. 199) that the density of P = (Pi, • • • , P n ) on S< is 



1 n 

££>a,b(pi, • • • ,Pn) := ^ Pn+l ( 1 ~ ^ Pj'" 1 - 

where p n +i — 1 — pi — ■ ■ ■ — p n and 

zip, b) - TT r ("i> r fe> 



b J-l-( a 3+ b j) 



is the normalizing constant. 

They called this distribution the Generalized Dirichlet distribution of parameters a = 
ax, ■ ■ • a n and b = b±, ■ ■ ■ ,b n . We recover the Dirichlet distribution Dir n (ai, • • • , a n ; b n ) when 
the parameters satisfy the relations bj-i = aj + bj i.e. 

bj = a j+ i H h a n + b n j = 1, • • • , n - 1 . (7) 

The two following properties are consequences of the construction ([6]) (see also |Won98j ): 

1) pw = (Pi,... ,p fc ) £ gv a(k)Mk) 

2) For every k = 1, • • • n — 1, conditionally upon Z\, ■ ■ ■ Z^, 

Pfe+l Pn 



1-Pl Pk '1-Pl Pk 



QV a 



fc+l,"' , a n,Ok + l, 



All the above models, where we provide each Zj with a Beta distribution with prescribed 
parameters, yield QT> distribution for the corresponding vector P. Conversely, it is known 
that if P is uniformly distributed on the simplex <S<, then Zj is Beta(l, n — j + 1) distributed 
for j < n. The QT> distribution has a more general covariance structure than the Dirichlet 
distribution. 

In the Section 12.61 we carry out the same construction for l v ball. 
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2.3 Real canonical moments 

In this section, we recall some interesting objects related to moment spaces. In [Kno57j, 
aiming to extend Brunn-Minkowki's theorem to convex bodies, Knothe introduced a general 
coordinate change. Skibinsky |Ski67] used this tool in the context of moment spaces. His goal 
was the study of some geometric aspects of these sets. In this context the new coordinates 
are called canonical moments. These quantities play an important role in moment problem 
theory. Indeed, they appear in many topics such as the orthogonal polynomial recurrence 
relation, the Stieltjes transform (and its expansion in continued fraction), etc... Actually the 
canonical moments seem to be more intrinsically related to the probability measures than 
the algebraic moments. In Section 14.11 we present the canonical moments for complex mo- 
ment space. Although a geometric construction is possible we define them using orthogonal 
polynomials on the unit complex circle following Simon in [Sim05bJ. We refer to the excellent 
book of Dette and Studden [DS97J for a complete overview on canonical moments. In next 
section we will carry the same geometric construction to £ p balls. Recall that we denote 
by A^iQO, 1]) the set of all probability measures on [0, 1] and by Mp 1] the N-th algebraic 
moment space generated by probability measures on [0,1] (see [p. Let ji G A / li([0,l]), we 
define, for n > 1, 

c n+i G") = max { r eR: (mi, ■■■ ,m n ,r) e M [ ^\ j 
c n+i G«) = min jr G K : (m 1 , ■ ■ ■ , m n , r) G AfjJ$ } , 

where (mi,m2, ...,m n ) is the vector of n first moments of /i. The first canonical moment is 
Ci = rri] and, for n > 1, the n + 1-th canonical moment is defined as 



c ti+i(aO 



whenever c,^ +1 (/i) > c~ +1 (/i). The last condition is verified if, and only if, 

(m 1 ,m 2 , ...,m n ) G intM^ ' 11 

Obviously, the canonical moments depend on [i just through its moment vector. Thus, 
given (mi,m2,...,m n ) G intMl ' 1 ^ the vector of n first canonical moments is completely 
defined. Furthermore, the mapping (mi, m 2 , m^) v-¥ (ci, C2, Cat) from intMjy' 1 ' onto 
(0, 1)^ is bijective and triangular in the sense that for every k < N, Ck depends only on 
(m 1 ,m 2 ,...,m lt ) and not on m k+1 , ■ ■ ■ ,m N . The range sequence is given by the following 
relation due to Skibinsky (see [Ski67j or Theorem 1.4.9 in [DS97J) 

i-i 

4(1*) -crM = n^'M ( : _c i(^))' i = 2,3,...,iV. 

From this relation, it follows that the bijective mapping m i— y c from int m]^' 1 ^ onto (0, 1)^ 
is a diffeomorphism whose Jacobian is 

p., \ AT-l 

= n fed - ■ <8) 

0»(Ci, • ■ ■ ,C N ) ^ 
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The Jacobian in (JS} leads to the following result. 

Theorem 2.1 (Chang, Kemperman, Studden). If m]^' 1 ' is endowed with the uniform dis- 
tribution, then the random canonical moments Cj j — 1, • • • , N are independent and 

e dx) = (2 ( ^~_ 2j j) | )2 1)! - x) N ~ J MoA)(x) dx. 

It is Theorem 1.3 in |CKS93j (see also p. 305 in |DS97j ). In other words Cj is Beta(iV — 
j + l,N-j + l) distributed. 

2.4 Canonical coordinates on the unit real ball 

Following the last geometric definition of canonical moments in Mjy' 1 ^, it is possible to 
build similar quantities on the l v ball. To begin with, let for iV > 1 and 1 < p < oo let Bjv, p 
(resp. B 1 ^ ) be the the unit p-ball of K, (resp. on C). That is, 

{/ n \ Vp 

x = (x ll x 2 , -,x N ) G R N : \\x\\ p := I ^ \xi\ p J < 1 

Extending the previous notation, Mn j00 will denote the unit open l°° ball, i.e. (—1, 1)^. For 
x = (xi, X2, xn) G M. N and 1 < k < N, set = (xi, 22, x^), i.e. the subvector of 
x built by its first k coordinates. By convention, we set ||^^||p := 0. We introduce the 
canonical coordinates c = (ci, . . . , cjy) G (—1, 1)^ of x G Bjv iP 

Ci = Xi, 



c fc = ; Xk „ ==, k = 2,3,...,N. ^ 



Knowing x^ k 1 \ we see that ^— (l — ^ ^ p , (l — \\x^ k ^Up) 1 ^ is the admissible 
range of x k in order that x lies in B^p- Let us denote by Cn the mapping from Bjv,p 
onto (—1, 1)^ which associates to any x G Bjv,p the point c = (ci, • • • , cjv) defined in 
The following key property is straightforward. 

Lemma 2.2. T/ie mapping C is a triangular C l -diffeomorphism, its inverse is given by 



[C~\c)\ k = c k f/(l-\ Cl \p)(l-\c 2 \p)---(l-\c k - 1 \p), k = l,2,---,N, 
the Jacobian matrix is lower triangular and its determinant is 



N 



d(x ir -- ,X N ) _ yj n ^lf=k 

d(c ir -- ,c N ) , i 



niV — fc 
(i-| Cjfc |f)— , ( Ci ,---,qv)g(-i,i; 
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2.5 Extension to the unit complex ball 

We extend the previous construction to the complex framework. First define 

TV \ V* 



B N,p ■- 



Z 



[ Zl ,z 2 , z N ) e C N : \\z\\ p = N P <1>- (10) 



The canonical coordinates are now 

Ci = Zi, 

c k = , Zk k = 2, 3,..., N 

where for z = z 2 , z N ), as in the real case, we set z^ = (zi, z 2 , z k ), k = 1, 2, N. 
Now the canonical coordinates belong to D. Conversely we have 



z k = c k f/(l - | Ci |p)(1 - |c 2 |p) ■ ■ ■ (1 - \c k ^\p) k = l,2,---,N. (11) 
We now point up the case p = 2. Consider the bijection T : — > R 27V , 

T(x 1 + iy 1? s/v + iyjv) = (^l, 2/1, ^iv, 2/jv)- (12) 

Only for p = 2 we have T (B^ ) = B 2 /v.p- However, the image of the canonical coordinates of 
z E B^ 2 by the previous bijective map are not, in general, equal to the canonical coordinates 
of the image of z. 

2.6 Sampling on the £ p ball 

2.6.1 Simplex and balls 

The simplest way to sample in the ball is to use the uniform distribution. So that, the 
stick-breaking scheme gives a first method to sample uniformly in Btv, p . As a matter of fact, 
it is enough to sample independent beta random variables and perform the change C^ 1 . This 
procedure is inherited from the sampling Dirichlet distribution as proposed in the book of 
Devroye (Theorem 4.2 p. 595 of [Dev86j chapter 11 - see also the error file). The following 
lemma gives a first connection with the simplex. 

Lemma 2.3. If X := (X±, ■ ■ ■ ,Xn) is uniformly distributed in Mn, p , then 
1. 

(Xi, ■ ■ ■ , X N ) = ^£ 1 p^ p , • • • , £7vP/v P ) 

where (pi, • • • , p^) follows the Dir N (p~ 1 , ■ ■ ■ , p -1 ; 1) distribution on and the e ? s are 
independent, Rademacher distributed and independent of (pi, • • ■ ,p n )- 

2. \\X\\v is Beta(N/p, 1) distributed. In other words 

II Y\\P — U l / N 
II Hp 

where U is uniform on [0, 1]. 
1) is direct via a change of variables and 2) is a genuine application of 
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2.6.2 ^-Generalized Dirichlet distributions 

Extending the previous lemma to generalized Dirichlet distributions is quite natural. 

Definition 2.4. Let aj, bj,j — 1, • • • , N be positive real numbers. We say that X follows the 
p-Generalized Dirichlet distribution on Btv, p with parameter (a, b) = (ai, ■ • • , a^, b\, ■ ■ ■ , b^) 
whenever 

(Xi, • • • , X N ) = (eip i y P , • • • , enPn^ 

where (pi, ■ • • , pn) follows the QV^ distribution on and the e's are independent, Rademacher 
distributed and independent of (p\, ■ ■ ■ ,pn)- 

From the Section I2.2[ we deduce that the p-Generalized Dirichlet distribution with pa- 
rameter (a, b) has a density with respect to the non-normalized Lebesgue measure on Bjv, p 

N N 

H a>h (x) = |- (1 - Hxl^)^- 1 J] (1 - Wx^X)^ 1 '^ \^\ pai ~ l ■ (13) 

Aa ' h ' 3=1 

Let us now go back to the parametrization of the balls developed in above section. Using 
equations and (II ip in the context of i v ball we recover the canonical representation: 

d = X, 



and 



c 3 


= *i( 


i-\\x^\\i) 


-i/p 


J = 2, • 


••iV. 


(14) 




= G x 
















"i-i 


i/p 








X j 


= C 3 


Ho--\Ck\ p ) 




J = 2, ■ 


■N. 


(15) 






.fe=i 











Here we easily get 

Proposition 2.5. For aj > 0, 6j > 0, j — 1, • • • , N, assume that X has the p-Generalized 
Dirichlet distribution with parameter (a, b) on Bjv, p one? se£ C := Cn(X) = (C\, ■ ■ ■ ,Cjy) 
(see Lemma [Oj) . T/ien 

i/ie random variables C\, ■ ■ • ,Cn are independent, 

ii) forj = 1, • • • , N, Cj = EjZ l J p , where Ej and Zj are independent, Sj has the Rademacher 
distribution, and 

Zj = Beta(aj, bj) . 
In other words, the density of Cj is 

¥{Cj e dx) = £)mV " rl ~ dx ■ 
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When the relation (J7J) is satisfied we have 



T) N ■ _i 

#a, b (x) = |- (i - ikua " n i^r* -1 • ( ie ) 

In that case, the family of distribution may be indexed by N+l parameters (aj., • • • , a/v, fr/v)- 
This family is stable by permutation of the coordinates, which is quite interesting. 
The following result is the counterpart of Theorem 12.11 



N 



Corollary 2.6. The density H a ^ is uniform on B^p if, and only if, for every j — 1, • • ■ , N 
dj = 1/p and bj = 1 + (N — j)/p, or equivalently 

Zj = Beta -, H 

\P P 

(*=** + 1) 

P(C 7 - G dx) = 7 v / 7 r- (1 - \x\ p )~ \, XVs (x)dx. 

2r ( i + 1^ r ( ^ + i x 



2.6.3 Some other sampling schemes 

There are two other ways to sample uniformly in B/v, p - They are issued from the polar 
decomposition. We give them for sake of completeness and because they will be useful in 
settling some limit theorems. 

Let <9Bat iP be the boundary of Mn, p (also called the sphere). The polar decomposition is 
the mapping 



7r : R \ {0} ->■ (0, oo) x dM NtP 



x 



x i — y n(x) = \\\x\\ p , (f>(x) :- 
Let A A? be the Lebesgue measure of M N and set 

Vn, p '■— Ajv(BjVp) = 2 -, r — . 

r(i + ?) 

Let hb be the cone (probability) measure defined by 

/is(A) = Ajv([0, 1]A) (A measurable subset of <9B/v, P ), 

where 

[0, 1]A := {r0 : r G [0,1], G A}. 
In other words, we have for any test function / the decomposition ( |NR03j Proposition 1) 



oo 

jV-1 



f(x)dx = NV N , P I r™- 1 / f(r ■ <p)dfJL B (<p)dr. 
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The probability /xb is the image of the normalized Lebesgue measure on Mn jP by the mapping 
<p. Notice that the cone probability and the surface measure are proportional if and only if 
p = 1 or 2. 

Lemma 2.7. The pushing forward of \ib by the mapping 

Oi, • • • ,x N ) e dE NtP i y Oi| p , • • • , \x N \ p ) e S N 
is Dir^ip" 1 )- Moreover, for 1 < k < N, 



\P V ) 



The following lemma is well known. 

Lemma 2.8. Ifr/( N ^ := (771, ■ • • ,i]n) has a distribution depending only upon \\r]^ N ^\\ p without 
any mass in 0, then 77^7 11*7 Hp ^ s independent of ||f7^|| p and its distribution is the cone 
probability hb on dB>N tP . 

Let £( N > := (£1, ■ • • , ^tv) be uniformly distributed in M N>p . Let (gk)k>i be an i.i.d sequence 
having G p distribution (see Section [2~T1 ) . Further, set = (gx, ■ ■ ■ , ^at). There are two 
methods to generate a uniform sampling ^ N > in the ball, both using a draw of G^ and an 
extra independent variable. 



A) According to Lemma 12.81 



is independent of ||G^|| P and is /zg distributed on dM NjP . From Lemma l2"l?l 2). we claim as 
in [CDT] that 

A xjVn.qW^ (17) 

where U is uniform on [0, 1] and sampled independently of (gk)k>i- 
B) In |BGMN05j . it is proved that 



(\\GW\\ P P + Z) 1/P 



where is as above and Z is exponentially distributed and independent of G^ N \ Let us 
give a direct proof here since it is easy in our framework. Writing the righthand side as 

G (N) G (N) / \\GW\\P N ' " 



{ \\GWf p + Z ) 1/p \\G( N % \\\G^% + . 

we see from Lemma [2.81 that the two factors are independent. From the additive property 
of Gamma distribution, is 7(iVp _1 ) distributed. So that, we can apply (BJ and get 



" G " A '" {: = Beta(iVp- 1 ,l), 



\\GW\\ P P + Z 



The conic measure is misnamed there "surface measure" 
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and taking the power 1/p, this is exactly the required radial distribution (see Lemma [2731 
point 2.). 

Recently, various authors ([B GMN05] . |Nao07] ) were interested in the dependence struc- 
ture of coordinates when sampling randomly in the unit ball or on the sphere. In this section, 
we give direct proofs of these results, carrying the known properties of Dirichlet distributions. 
In Lemma 2 of [N ao07] (with references to analytical and geometric proofs in older papers), 
Naor proved that the coordinates of a random sampling on the sphere are negatively upper 
orthant dependent. To be more precise, let us recall two definitions ( [ JDP83] ) . 

Definition 2.9 (Joag-Dev, Proschan). 1. Random variables Ux,--- ,Uf- are said to be 
negatively associated (NA) if for every pair of disjoint subsets A\,A 2 of {1, • • • , k} 

Cov{A([/m g A l ),f 2 (U j ,j e A 2 )} < 0. 

whenever fx and f 2 are increasing. 

2. Random variables Ux, ■ ■ ■ , £4 are said to be negatively upper orthant dependent (NUOD), 
if for all real Xx,- ■ ■ , Xk, 

k 

F(Ui>Xi,i = l,--- ,k) < Y[F{Ui>Xi). 

i=l 

or equivalently, if 

(k \ k 

n ) <n E (^))- 
i=i y i=i 

for fx, • • • , f n increasing positive functions. 

Some authors called property (j2j) sub- independence. NA is strictly stronger than NUOD. 

Proposition 2.10. If £ — (£i, ■ • • , £ n ) follows a Dirichlet distribution on S< or S n , then 
the variables ■ • • , £ n are NA. 

This property is claimed for Dirichlet distributions (without precision) in ([JDP83J). 
Nevertheless, for the sake of completeness, we present a new short proof here. 

Proof. In a Dirichlet distribution the permutation of variables or the elimination of few of 

them yields still a Dirichlet distribution, thus it is enough to show that iiU = Dir n (ai, • ■ ■ , a n ; a, 
then for r < n and, / and g increasing functions 

E [f(Ux, ■■■ , U r )g(U r+1 , ■■■,U S )}< Ef(Ux, ■■■ , U r )Eg(U r+l , ■ ■ ■ , U n ) . 

Conditioning by S = U\ + ■ ■ ■ + Uk and using ((3]) it follows 

E [f(Ux, ■■■ , U r )g(U r+1 , ■■■ ,U n )]= EF(S)G(S) 

with F{S) = E s f(Ux, ■■■ , U r ) and G(S) = E s g{U r+1 , ■■■ ,U n ) (here E s denotes the condi- 
tional expectation with respect to S). If we write U as 

U = (S£x, • • • , S£ r , (1 - S)Ui, •••,(!- S)Q 
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it holds 

(fi, ■• ■ >&•) = Dir(ai, • ■ ■ ,a r ) 

(£r+i> ' ' ■ ; Cn) = Dir n _ r (a r+ i, • • • , a n ; 

S = Beta(ai + • — h a r ; a r+ i + • ■ ■ + a n+ i) 

and these three variables are independent. It follows then F(s) = E/(s£i, • • • , s£ r ) and 
G(s) = Eg((l — s)£i, ■ ■ • , (1 — s)£ r ). The function F is increasing and G is decreasing, then 
applying the usual correlation inequality for a single variable yields 

EF(S)G(S) < EF(S)EG{S) 

and we obtain EF(S) = Ef(Ui, ■ ■ ■ , U r ) and as 1 — S = Beta(afc + i + • • • + a n+ i, a±-\ — • + a r ) 
we recover by product EG(S) = Eg(Uk+i, ■ ■ ■ , U n ). ■ 

In [BP98j and pBGMN05] (see Theorem 6 and Lemma 5), the NUOD property was proved 
for the coordinates of a random sampling in the ball. Actually we can revisit these statements 
for a larger class of distributions and extend them to the NA property. 

Theorem 2.11. 1. If X = (X±, ■ ■ -Xn) has the density of (fIBi) in the ballM^p, then the 
variables \X±\, • • • , \Xn\ are NA. 

2. If X = (Xi, ■ ■ • Xn) has the density 

i N 

on the sphere dM^^ (where Z a is the normalizing constant), then the variables \Xx\, ■ ■ ■ , \Xn 
are NA. 

Let us stress that the uniform distribution in the ball (resp. on the sphere) satisfies the 
assumptions of the last theorem. 

Proof. Let & = for i = 1, • • • ,N and f = (ft, • • • , &v)- 

1) When X has the density i? a ,b of f|T6|) . a small change of variables yields 

f = Dirjy(ai,-- - ,a N ;b N ). 

We saw at the end of Section [2TT1 that the variables are NA, and then the variables \X^\ 
inherit the property. 

2) We have 

(fi, • • • >6v) = Dir(ai, • • • ,ajy) 
and we get the same conclusion as above. ■ 
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3 Asymptotics for ^-generalized Dirichlet distributions 



In this section, we use the previous representations to obtain asymptotic results (as 
N goes to infinity) for the finite dimensional projections of random element of Bn. p - We 
consider both convergence in distribution and large deviations. 



3.1 Convergence in distribution revisited 

3.1.1 Poincare-Borel for the uniform distribution on the ball 

With the representation (|17|) we get easily the Poincare-Borel like lemma (see Lemma 1.2 
of [Led96] and [DF87j for historical account). As before, (gk)k>i is a sequence of independent 
G p distributed random variables and = (gi, ■ ■ ■ , g^). For every N > 1, set 

G (N) 



IIGWIIp" 

Now, if rf N } := (r]i, ■ ■ ■ ,T]n) is uniformly distributed on the l v sphere <9Bat jP , we have 

N*r,W = = ( — w— — ) ' . 

If £} N i := • • • , £jv) is uniformly distributed in the ball M^ :P , we have 

Np^= U 1/N Nr<f) {N \ 
By the strong law of large numbers, 



Efe=i \9k\P 



a.s. 



N 



(E\ 91 \T 1/P = P 1/P - 



Moreover U N converges to 1 in distribution. This yields a p-version of the classical form 
of the Poincare-Borel's lemma: 

Proposition 3.1. If k is fixed, and if denotes the projection on the k first coordinates, 
then, as N — > oo, 



3.1.2 Poincare-Borel for generalized Dirichlet distributions 

Let X and C be as in Proposition 12.51 and k be a fixed positive integer. We will first give 
a result on the asymptotic behavior, for large N, of {N l / p X^) (k > is fixed). Our proof 
uses the canonical representation of the ball and is quite simple. 
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Theorem 3.2. Let k > 1 be fixed. Assume that, for j = 1, ■ ■ • , k, pbj = N + o(N). Then, 
where e±, ■ ■ ■ ,e k ,Z\,--- , Zk are independent, the e's are Rademacher distributed and for 



d 



1 



In other words £jZ l J p has the density 



j 

1— a,- 



f{e J Z) ,p G dx) = ^-—\x\ pa ^ 1 exp{-\x\ p /p)dx, (x G 



The proof is a straightforward consequence of the following useful lemma, whose proof 
follows the representation (jl]) and the law of large numbers. 

Lemma 3.3. Let 9 > and c{9) such that, as 6 — >■ oo ; lim = c > 0. Then, for every 
a > 0, 

Beta(a, c{6)) — > 7(0, c) . 
Proof of Theorem \3.2 . The above lemma yields for j < k 

N i/ PC M+z ( y p)} 

n 

and by independence we get the convergence in distribution of jV^C^. Further using ([9]) 
we may write 7V 1/p X (/c) = A^ 1/p C (fe) + o P (l) and may conclude. ■ 

3.2 Large deviations 

Let us turn out to the large deviation companion theorem of Theorem 13 .21 For the basic 
notions on large deviations, (definitions, contraction principle...) we refer to [DZ98J. 

Theorem 3.4. Under the assumption of Theorem \3.& (X^ k ') satisfies a large deviations 
principle (LDP) with good rate function 

I( x ) = -l\n(l-\\xZ) , (xEM k)P ). 
p v ' 

To prove this theorem we begin with the canonical variables. 

Proposition 3.5. Under the assumption of Theorem \3.§l (C^ k ') satisfies a LDP with good 
rate function 



J{c) = — In ff(l - |qH , c = ( Cl , c 2 , • • • , c k ) E (-1, l) fc . 
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The proof of this proposition is built on the following useful lemma. 
Lemma 3.6. Let a > 0, c > 0,9 > and c(9) > such that c(6)/6 — > c as 9 — > oo. Let 

Y e = Beta(a,c(0)) 

Then, when 9 — > oo, the family of distributions of (Yg) satisfies the LDP on (0, 1) at scale 9 
with good rate function: 

J(x) = — clog(l — x) . 

This result appears with variants in the literature (Lemma 3.1 in |DF06j. Lemma 2.1 in 
[Fen06j ) . It is proved therein with a direct computation. We give here another proof that en- 
lightens the role played by the exponential distribution in all LDP about Beta distributions. 
A slight variant of this proof was used once in Lemma 4.3 in [GLC04J. 

Proof of Lemma \3. (A We start with the representation (J3J): 

d 7(a) 



7 (a)+ 7 '(c(0)) 



where 7(a) and 7'(c(#)) are independent. First, observe that the family of distributions of 
6 l ~ 1 7 (a) satisfies the LDP on (0, 00) with rate function 

I (x) = x. 

Then, observe that the family of distributions of # _1 7 (c(9)) satisfies the LDP on (0, 00) with 
rate function 

Jo(x) = cx — 1 — log(cx) . 

By independence, the pair (# -1 7 (a), 9~ 1/ -/' (b(9))) satisfies the LDP with rate function (x±, x 2 ) 1— > 
Io{xi) + ^0(^2)- Since the mapping 

(0, cx)) x (0, 00) ->- (0,1) 
(x,y) h-> 



x + y 

is continuous, the contraction principle gives an LDP for (Yg) with rate function 
J(x) = inf <^ I (xi) + J (x 2 )] — — — = x\ = -clog(l - x) , 

y xi + x 2 J 

which ends the proof. 



Proof of Theorem \3.5[ Since the coordinates of are independent the LDP will follow 
from the LDP of each Cj (j = 1, 2, k). From Proposition 12.51 observe that 

Cj — ^jZ^ p , Zj — Beta(oj,6j) . 
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For j fixed, we may apply Lemma 13.61 with 

b- 1 

a = cij ,9 = N, b{6) = bj,c = lim = -, . 

We get an LDP for Zj on (0, 1) with rate function x t— > —p^ 1 log(l — x). One easily deduces 
from this that Cj satisfies the LDP on (—1, +1) with rate function x h-> — p^ 1 log(l — \x\ p ). 
By independence, the vector C^ k > satisfies the LDP on (0, l) k with rate function 

k 

(c X) •■■ ,c k )t->^2 -p' 1 log(l - \c k \ p ) . 

3=1 



Proof of Theorem\3.4\ LDP for follows directly from the contraction principle. The 



good rate function is given by the relationship 

I{x) = J(C k (x)), x e B feiP . 
Furthermore, from the definitions of the canonical coordinates (see (jUJ)) it follows that 

k 



Ho- - ^ = 1 



l-MP 

I"" lip ! 



1=1 



which allows to conclude the proof. ■ 

The £ 2 -ball and a functional LDP The £ 2 -ball case is quite peculiar, due to its connec- 
tion with the functional unit ball 

B 2 := {/GL 2 ([0,1]):||/|| 2 <1}, 

of the Hilbert space L 2 ([0, 1]). This allows to extend somehow the LDP of Theorem 13.41 In 
this section, we will give a functional LDP companion result of Theorem 13.41 Let (e n ) n >i be 
any orthonormal basis of L 2 ([0, 1]). So that, we may rewrite 

M Nt2 := ^{x 1 ,x 2 ,...,x N ) eR N :xi = (f,*), % = 1,2,.. .,N, f G £ 2 j. 

Let the random sequence (F N ) of B 2 be defined by 

N 

FN = X 



N 

1=1 



where the sequence (Xj) satisfies the assumption of Theorem 13.21 The following Theorem 
follows directly from a projective limiting procedure (see [DZ98] Section 4.6). 

Theorem 3.7. The sequence (Fn)n satisfies a LDP in B 2 with good rate function 

w) = -^(i-ii/y. 
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3.3 Donsker limit theorems 



Come back to Proposition I3.1[ in a first extension we can take k = k(N) and show that 
the distance in variation between the law of tc^^Npt]^) and G® fc tends to as soon as 
k(N) = o(N) |NR03l Theorem 3]. The Euclidean case (p = 2) is treated in |DF87j (see 
|Jia06] for related results). Besides, since we may write for every N > 1 



n^N^l^v^seio,!}) £ • [N-^9k,se [0,1] 



we can deduce the convergence to the standard Brownian motion {W s ,s G [0, 1]}. It is the 
classical Donsker's theorem for self-normalized processes. It holds actually under very weak 
assumptions (see |CSW03j ). In the same vein, owing to the results of Shao ( |Sha97j ) the 

family — (j2k=i Vkj satisfy the LDP with good rate function 



i{x) = mf sup t\x\( P - i)c* - log ^incy-iit^i rj ; dy 



c>0 



t>0 



exp (t (cy — (1 + 


x\)\y\ 




2P (i + §; 


) 



One may think that the LDP holds also for the sequence of processes — J2k^i £k, s G [0, 1] J 

T((p)= [ I(<p(8))d8. 



with the rate function 



3.4 The i 1 ball and the GEM(q, 6) distribution 

In a nice paper, Dawson and Feng ([DF06], Theorem 4.3) proved that the LDP holds in 
Bjv,i when the underlying canonical variables are Beta (1,9) and let 9 — > oo. It is exactly a 
particular case of our Theorem 13.41 with p — 1, /3 3 - = and a.j = j/p and 9 = N/p. In their 
Theorem 4.4, they extend the LDP to B^i with rate function defined on by 

h(x) = -log(l - ||x||i). 

It is the so called GEM(#) model. It is exactly a particular case of our Theorem 13.41 with 
p — 1, (3j — and atj = j — 1 and 9 = N. 

In another paper Feng |Fen06] proved the LDP when the k-th canonical variable is 
Beta(l — a, 9 + ka) distributed. They obtained the same rate function. It is the so called 
GEM(a;,^) model. It is exactly a particular case of our Theorem 13.41 with p — 1, (3j = —a 
and aij = (a + l)j — 1 and 9 = N. 



4 Moment spaces revisited 

4.1 Moments: the complex case 

All this subsection comes from the book of Simon |Sim05bj Section 1 or |Sim05a] Sections 
2 and 3. We recall here the connection between moments of a probability measure on the 
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torus T and canonical moments built through orthogonal polynomials. To begin with, let \x be 
an arbitrary nontrivial (that is not supported by a finite number of points) probability on T. 
The functions 1, 2, Z ? ■ • • £1X6 linearly independent in L 2 (T, dp). Following the Gram-Schmidt 
procedure we define the monic orthogonal polynomials ($ n )- More precisely, $o(z) = 1 and 
for n > 1, & n ( z ) is the projection of z n onto {1, • • • , z™" 1 }- 1 . If \i is supported on the finite 
set {zi, ■ ■ ■ ,zn}, we still define $fc until k = N — 1. We define $at as the unique monic 
polynomial of degree N such that \\&n\\ =0 i.e. 

N 

®n(z) = ]J(z - Zj) . 

3=1 

Some useful polynomials associated to the sequence ($ n ) are the reversed (or reciprocal) 
polynomials. They are defined by &q{z) = 1 and 



= z n $ n (l/z). 

Notice that $* is the unique polynomial of degree at most n, orthogonal to z, z 2 , ■ ■ ■ , z n and 
such that $*(0) = 1. We now define a quantity which appears to be central in our paper. 



Definition 4.1. For j e N, we define the canonical moment Cj := — <& 3 -(0). 

In the sequel, when it will be necessary to make precise that the canonical moment 
depend on the underlying measure /i, we will sometimes also write Cj(p). The coefficients Cj, 
j > are called Verblunsky coefficients by Simon. They are also named after Schur, Szego, 
or Geronimus |Ibra68j . They are sometimes called reflection coefficients Burg75 . One of 
their properties is recalled below without proof for further use 

Proposition 4.2. 

n+l 

pn +1 ir=(i-k +1 i 2 ) ii*„ii 2 = n (1 - M a ) . 

3=1 

Consequently, if fi is nontrivial, c 3 G D for every j > 0. Further, if the support of p 
consists in N points, then Cj G D for 1 < j < N — 1 and cn G dB>. A theorem due to 
Verblunsky asserts that the correspondence between p and the sequence of its canonical 
moment is a bijection. The Verblunsky's formula ( |Sim05b] Theorem 1.5.5) claims that for 
each N, there is a polynomial V^ N \ci, ■ ■ ■ , c/v-i, c , • • ■ cn) with integer coefficients so that 
the moments {£„}„ of \i satisfy 

. N-l 

t N := J z N p(dz) = c N 11 (1 - |c/) + V^ N \c l} • • • , Cjv -i, Co,--' c N -i) ■ (18) 



3=1 



Conversely, is a rational function of • • • , tjv-i, *jv-ij Moreover, as remarked by 
Simon ( |Sim05b] Section 3.1), formula f|T8|) tells us that canonical moments measure relative 
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positions of t n among all values consistent with Co,Ci, ••• , c n _i. To be more precise, for 
n > 1, set 



Ml=\(f z^(dz)) : /i G A4i(T 

I V</ / l<j<N 



where A^i(T) denotes the set of all probability measures on T. Then, given (ti, ■ ■ ■ , tjv) £ 
Mjj, the range of the (N + l)th moment 

t N+ i = J z N+1 d V (z) 

as 77 varies over all probability measures having (ti, • • • , £jv) as A" first moments, is a disk 
centered at sn+i = V^ N '(ti, ■ ■ ■ , ijv,£i, • • -£jv) with radius 

N 

rN+i = JJ i 1 - l c i! 2 ) 
i=i 

(by Verblunsky theorem, these quantities only depend on the prescribed N first moments). 
If r N+ i y£ 0, the relative position is 

t N+1 - s N+1 g R 

A very nice result is that the above quantities are the canonical moments cn+i of \x (see 
[DS97]). So that, as pointed out in |DS9 7]. canonical moments may be built both geometri- 
cally or algebraically. 



4.2 Moment space and generalized Dirichlet distribution 

In this section we discuss the connection between randomized balls and randomized 
moment spaces. Indeed, the asymptotic results in [CKS93] . [GLC04] and |LC05] are in the 
same spirit as those obtained here. In an early version of the present paper, we wrote a 
result exhibiting a natural way to push forward the uniform measure on complex moment 
spaces towards the uniform one on the complex Euclidean balls. Unfortunately, this nice 
result is not true! It was based on a wrong Jacobian computation for complex moment space 
performed in Lemma 7.3 in |LC05] where a factor 2 is missed in the exponentiation. 



Complex moments and Euclidean balls The aim of this subsection is to underline a 
natural connection between the moment space Mjj and the Euclidean ball. This will done 
using canonical moments. To begin with, let us go back to the sequence of orthogonal 
polynomials for the probability measure \i. Let be an integer such that the support of \x 
has cardinality at least A/" + 1. Here, we will normalize the orthogonal system in a different 
way than in Section 14.11 by setting: 

y?o = 1, <Pn — TT~!rip n = 1,2, N. 
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We also define the associated reversed polynomials: 



,1 TV II 

Since (p* N is a polynomial of degree and since ||y?^|| 2 = llv 9 ^!! 2 — 1> we have 



N 



fc=0 

Set 

7T fc := (Lf* N ,Lp k ), k = 0,1, N 
the formula (1.5.59) in |Sim05b| yields to the following lemma. 

Lemma 4.3. 

n 

~k = ~Ck i ] ] V 1 ~ \ C A 2 , k = 1,2,...,N, 

r=k+l 

where by convention Cq = —1. 

For the sake of completeness let us give a short proof. 

Proof. Let P be a polynomial of degree at most N. Recall that $^ is orthogonal to z, 
so that ($*, [P(z) - P(0)1]) = 0. Therefore 



(<1>^P> = P(0) y = P(0) y ^ JV $ JV (z)d//(z) = P{0)\\*n\ 

hence, taking P = ip k (for < AT) 

/ <J>* \ 11$ II 

The previous equality and Proposition 14.21 give 

N 



»ro= 11**11 =nv* 



r=l 



Now for 1 < k < N, by the same arguments we obtain 



a ^ ifedl - ll $jV H - IL=i I'H - TT /T 



jv r^ni N 



c, 



|2 



r=k+l 
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Now, from ([20]) and ffl~9]) we see that the point 

z= (zi,--- ,z N ) := (7Tjv, ••• ,7Ti) (21) 
lies in the complex ball B^ 2 (see ffTUl) below). Furthermore, setting: 

«V = -cjv+i-r, r = 1, 2, . . . , N 
we get from (T2T1) and Lemma [4.31 

Z\ = Ki 

3-1 

z i = «i II V 71 - Kl 2 , j = 1,2, ...,N. 

8=1 

Roughly speaking, the previous relation is the analogue for the ball of ffTSj) . Notice that the 
relationships (£&) -H- (cjt) and (zjt) -H- (ac&) are both triangular and bijective. They measure 
the relative position of a coordinate knowing the previous ones. 



The real case. 

diagram 



Finally, we discuss a result in the real case. Consider now the following 



(0,1)" 



»N,2 
(iii) 

hl) N 



(-1,1) 



where (i) is the canonical moments transformation, (ii) is the coordinatewise transformation 
t i — y *2t — 1, (iii) is the inverse canonical coordinate transformation and Sjy is obtained by 
composition of these transformations. 

Obviously, using Theorem 12. II and Proposition 12. 5[ the pushforward of the uniform prob- 

leads to the generalized Dirichlet Distribution on on 



ability measure on Mp 1 ^ by Tj N 



2 with di = Cl2 



a N 



1/2 and bj = N - j + 1, j 



1,2, 



N. 
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